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POPULAR SUMMARY 
Submitted to Journal of Climate 


A number of satellites presently in orbit, including the Tropical Rainfall 
Measuring Mission (TRMM), are producing global maps of rainfall amounts, 
sometimes on a daily basis, sometimes on a monthly basis. The rainfall values 
on these maps have considerable errors in them, partly due to problems with 
remote sensing techniques for measuring rain, and partly because the satellite 
doesn't view each spot on the earth continuously. The latter kind of error is 
referred to as "sampling error," because the maps are derived from occasional 
samples or snapshots taken by the satellite as it orbits the earth. 

There have been many studies attempting to provide quantitative estimates of 
how big sampling error might be for each rainfall value at each location on a 
satellite rainfall map. This paper is a significant contribution to this effort 
because it uses radar data from a large section of the U.S. (similar to the radar 
data displayed on weather channels) to make many estimates of what the 
sampling error in satellite rainfall maps should be for thousands of different 
cases. It uses a fairly straightforward, common-sense method to do this, but it 
requires substantial amounts of computer time to produce these values. A 
simple formula that seems to predict the sampling error with good accuracy is 
found that predicts the sampling errors quite well, so that sampling errors can 
be estimated in situations when radar data are unavailable or the computer 
analysis cannot be done. 

The paper also compares the sampling errors obtained with the straightforward 
method to sampling-error estimates obtained using some theoretical ideas 
developed in earlier papers by T. L. Bell and P. K. Kundu. The two estimates 
compare very well, thus providing some support for extending the theoretical 
predictions to areas where there are no radar data, such as in undeveloped 
countries and over the oceans. 
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Abstract 


The uncertainty of rainfall estimated from averages of discrete samples collected by a satellite is 
assessed using a multi-year radar data set covering a large portion of the United States. The 
sampling-related uncertainty of rainfall estimates is evaluated for all combinations of 100 km, 
200 km, and 500 km space domains, 1 day, 5 day, and 30 day rainfall accumulations, and regular 
sampling time intervals of 1 h, 3 h, 6 h, 8 h, and 12 h. These extensive analyses are combined to 
characterize the sampling uncertainty as a function of space and time domain, sampling 
frequency, and rainfall characteristics by means of a simple scaling law. Moreover, it is shown 
that both parametric and non-parametric statistical techniques of estimating the sampling 
uncertainty produce comparable results. Sampling uncertainty estimates, however, do depend on 
the choice of technique for obtaining them. They can also vary considerably from case to case, 
reflecting the great variability of natural rainfall, and should therefore be expressed in 
probabilistic terms. Rainfall calibration errors are shown to affect comparison of results obtained 
by studies based on data from different climate regions and/or observation platforms. 
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1. Introduction 




Monitoring rainfall on a global scale is key to a quantitative understanding of the global 
hydrologic cycle and our climate system. Observations from spacebome platforms offer global 
coverage, albeit with limited sampling in space and time depending on the satellite’s orbit and 
instrument configuration. This limitation in sampling frequency, in combination with the 
intermittence of rainfall in space and time, causes satellite-based rainfall estimates to be 
uncertain. In this study, the sampling-related uncertainty o E is assumed to be a function of the 
rainfall rate R , the domain size A , the time integration T , and the sampling time interval At ; 
that is 


°E=f\ 


J_ J_ At" 

r’a’t/ 


( 1 ) 


Studies such as North and Nakamoto (1989), Bell et al. (1990), Steiner (1996), Bell and Kundu 
(2000), and Bell et al. (2001), using ground-based rainfall data, have shown that this uncertainty 
is expected to decrease for higher rainfall rates, larger domain sizes, and longer time integration. 
This has also been seen in studies using satellite data, for example, by Chang et al. (1993), Weng 
et al. (1994), Berg and Avery (1995), and Chang and Chiu (2001). On the other hand, increasing 
the sampling time interval (i.e., reducing the sampling frequency) will result in a larger 
uncertainty. A recent survey of sampling uncertainty for various geophysical parameters is 
provided by Astin (1997). 

Using a multi-year data set of continental-scale, radar-based rainfall observations over the 
United States east of the Rocky Mountains, the sampling-related uncertainty of averages of 
observations made at regular time intervals is studied in depth. Irregularities in the space-time 
sampling pattern (e.g., Salby 1982a, b; Wunsch 1989; Chelton and Schlax 1991; Wu et al. 1995; 
Zeng and Levy 1995; Negri et al. 2002) and issues of rainfall retrieval accuracy (e.g., Wilheit 
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1988; Bell et al. 1990, 2001; Ha and North 1995) or combination of observations from multiple 
satellite platforms (e.g., Shin and North 1988; North et al. 1993; Bell and Kundu 1996) are not 
considered as part of this analysis. In particular, the sampling-related uncertainty is evaluated as 
a function of typical space and time domains, sampling frequency, and the rainfall intensity. 

The present analyses go beyond what previous studies have achieved in at least two major 
ways: (1) an extensive data base is explored in depth and (2) two distinctly different approaches 
of estimating the sampling-related uncertainty are compared. Moreover, an attempt is made to 
characterize the accuracy of such uncertainty estimates. 

This study thus aims at quantifying the uncertainty (often dubbed sampling error ) of 
remotely-sensed rainfall estimates based on discrete sampling in space and time. The results will 
provide guidance for interpretation of rainfall estimates from satellites, such as the Tropical 
Rainfall Measuring Mission (TRMM) satellite (Simpson et al. 1988; Simpson et al. 1996; 
Kummerow et al. 1998) or the Advanced Microwave Sounding Units (AMSU) flown aboard the 
current operational National Oceanic and Atmospheric Administration (NOAA) polar-orbiting 
satellite series (e.g., Kidder et al. 2000; Ferraro et al. 2002), and planning of future satellite 
missions, such as the Global Precipitation Measurement (GPM) mission. 

2. Analysis procedures and data 

a) A framework for estimation of sampling uncertainty 

There are at least two different statistical approaches to estimating the sampling-related 
uncertainty of rainfall: parametric methods (with stochastic space-time rainfall model parameters 
fitted to data) are contrasted by non-parametric, purely empirical methods (based on subsampling 
scenarios). These latter methods typically build on resampling by shifts techniques based on 
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high-resolution rain gauge and/or radar data (e.g., McConnell and North 1987; Steiner 1996; Li 
et al. 1996). A framework is developed here that enables direct comparison of the two 
approaches. 

In the study of sampling uncertainty by Laughlin (1981), a satellite is assumed to make its 
first observation at t = 0, subsequent observations at regular intervals of At, and its last 
observation at t = T. The resampling by shifts method of estimating the sampling-related 
uncertainty assumes instead that the simulated satellite observations begin at an arbitrary time t 0 

with 0<t 0 < At . Laughlin’ s approach, however, can easily be modified to accommodate 

arbitrary starting times within the averaging interval [0,r], as summarized below. Except for the 
starting time, the assumptions are the same as in Laughlin (1981): the satellite sees an area A 
(all of it) at intervals At during a time period T . Sampling begins at starting time t 0 , and a total 

of n = Tj At samples are collected. Regardless of the starting point t 0 , the true average rainfall 
is defined to be 

R=j]x A (l)dl, (2) 

* 0 

while the sample average, with starting time t 0 , is 

*(0 = -IX('o+AAO- (3) 

n k-0 

R a ( t ) is the instantaneous rain rate at time t averaged over a grid box with area A . The error in 
the sample average due to the discrete sampling for a particular starting time t 0 is 

e(t 0 )=S(t 0 )-R. (4) 
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The resampling by shifts method obtains an estimate of the mean-squared sampling uncertainty 
c\ from the average of e 2 (t 0 ) over all possible values of t 0 in the interval 0 < t 0 < At , which 
may be denoted as 

<7f=( £ ’ (»„))„• (5) 


Using the same statistical assumptions as Laughlin (1981), an estimate of (5) is derived in the 
Appendix, with the result 






(A t/r A )+ 



(A t/r A ) 


( 6 ) 


with 

c x (z) = coth(z/2) — 2/z (7) 

and 


c 2 





( 8 ) 


Here, o\ is the variance and t a the correlation time (i.e., 1/e -folding time of the 
autocorrelation) of the instantaneous area-average rain rate R A {t). A term of order (A t/T) 2 , 
which depends on t 0 and that was neglected in the approximation given in Bell et al. (1990), has 
been included in (6). A term of order exp(-T/t A ) has been omitted from (6); however, it is 
typically small and can be neglected. Shin and North (1988), Bell and Kundu (1996, 2000), and 
Bell et al. (2001) provide additional background for the derivation of Eq. (6). 

Equation (6) predicts that a E should be approximately linear in At for small At , because a 
power series expansion of (6) gives 
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( 9 ) 


_2 _ 2 
a E ~ a A 


1 1 


6 x a T 6T 2 


(A tf- 


360 x\T 120 t 2 a T 2 j 


(a,) 4 + . 


For T » T A (typically t A ~ 3-8 h), this can be simplified to 


a A A t 
g e ® ~ 




( 10 ) 


Thus (J E is linear in At for small At , and because the next order correction term in (9) is fairly 
small, the linearity may persist over a substantial range of values of At . As At becomes large 
compared with the correlation time t a , however, (6) predicts that a E should begin to scale more 

like -s/A t . 

It should be noted that this linearity in At is a consequence of Laughlin’s (1981) assumption 
that the autocorrelation of the area-average rain rate behaves like an exponential e'^ for small 

lags r . An autocorrelation that didn’t drop off so quickly for small lags, as , for instance, 

would lead to sampling uncertainty increasing as a higher power of At . As we will see later, the 
data exhibit an approximately linear dependence on At over the sampling frequency range 
investigated, suggesting that the autocorrelation of R A (t) may be roughly exponential. 

For T small enough, there is the possibility that Eq. (6) might predict deviations from simple 
proportionality to 7’" 0 5 . As we will see later, however, even for T as small as 1 day the 
deviations from the inverse-square-root scaling are small. 

How does <J E depend on the area A ? This is not quite as easy to assess, because the 
dependence of a E on A is governed both by the dependence of a A and of t a on A . For large 
A, assuming that the spatial and temporal correlation of rain events decreases rapidly for 
sufficiently large space and time separations, it is likely that a A ~ A^' 5 and that x A may become 
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independent of A ; thus, for large space domains a E may be proportional to A~°' 5 . For small A , 
however, this does not need generally be the case. In radar data collected during GATE, the 
Global Atmospheric Research Program (GARP) Atlantic Tropical Experiment (Kuettner et al. 
1974), for example, a behavior like a\ ~ 25^"° B mm 2 /h 2 and r A ~ 0.39.4 0 ' 26 h is seen (Bell 
1987; Bell et al. 1990), where A has units of km 2 . The dependence of a E on A for small A , if 
the fits to GATE statistics are to be believed, is thus approximately ~ A~° 3 according to (10). 

b) Data and analysis procedures 

The analyses of the sampling-related uncertainties are based on a multi-year, continental- 
scale, merged radar data product provided by Weather Services International (WSI) Corporation 
at a resolution of approximately 2 km in space and 15 min in time. Radar reflectivity of this 
product comes at 16 discrete levels. For the purpose of our analyses, the radar reflectivity factor 
Z was converted to rainfall rate R using a hail threshold of 55 dBZ and a gauge-adjusted 
Z = 6007? 14 relationship. A more detailed description and different use of this data product may 
be found in Carbone et al. (2002). Issues about the radar rainfall estimation are extensively 
discussed in Steiner et al. (1999) and references therein. 

This data set may not reflect the true rainfall that occurred at any given point in space and 
time; however, it provides a most realistic representation of rain variability over the continental 
United States east of the Rocky Mountains. The gauge-adjustment resulted in essentially 
unbiased radar rainfall estimates, as shown by Fig. 1. The analyses of the sampling-related 
uncertainty are thus on good grounds, particularly because they build primarily upon relative 
comparisons rather than absolute values, as detailed below. 
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The analyses discussed here are focused on the summer months June 1999 (Fig. lc), July 
2000 (Fig. Id), August 1997 (Fig. le), and September 1998 (Fig. If). These months were 
selected to represent data from various months and years, and to have minimal data gaps (less 
than 3 rainfall maps missing in total). Data gaps were filled by linear interpolation between time 
steps for each grid point individually. The present study domain spans approximately 35 N to 45 
N in degrees longitude and 80 W to 100 W in degrees latitude (Fig. la). Roughly speaking, this 
domain covers the area in between the Rocky Mountains (to the west) and the Appalachian 
Mountains (to the east), and reaches from Texas (in the south) to the Great Lakes (in the north). 

The study area was divided into squared domains with side length L (L = A 1 * * * 05 ) of 500 km (6 
domains), 200 km (48), or 100 km (192), respectively, and rainfall observations were integrated 
over time periods T of 30 days (1 period), 5 days (6), or 1 day (30) for our analyses. The 
sampling-related uncertainty was assessed for sampling time intervals At of 12 h, 8 h, 6 h, 3 h, 1 
h, and 15 min (full resolution), respectively. Analyses were carried out for all combinations of 
domain size, time period, and sampling frequency for all four months investigated. 

1) Approach BASED ON RESAMPLING BY SHIFTS 

The basic analysis procedure is that of a subsampling exercise to determine how much 
uncertainty is typically present in rainfall estimates, as a function of the frequency of sampling. 

The rainfall for a given time period is estimated from samples obtained at regular time intervals, 
assuming that each sample is representative of what occurred during the unobserved interval 

around it. All possible sampling scenarios based on the 15 min data and the selected sampling 
time interval are analyzed (by shifting the start time) and comparing the sample average to the 

rainfall estimate based on using all samples, as outlined in section 2a. 
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The sampling-related uncertainty a E , estimated as the standard deviation of the rainfall 
estimates obtained by successive shifts of the start time, is expressed relative to the true average 
rainfall as 



V var kK' 0 )J 


R 


( 11 ) 


where var[f(? 0 )] is the variance of rainfall errors e(t 0 ) as defined in (4), over all possible shifts 
in the starting time t 0 . This variance typically increases with decreasing sampling frequency. 

The resampling by shifts procedure has been employed in numerous studies (e.g., McConnell 
and North 1987; Steiner et al. 1995; Soman et al. 1995; Steiner 1996; Li et al. 1996). Steiner 
(1996), for example, used this methodology to estimate the sampling-related uncertainty of 
surface rainfall based on extensive rain gauge information. Using radar data, these analyses were 
subsequently expanded by Steiner and Houze (1998) to examine the sampling uncertainty of the 
entire three-dimensional structure of rainfall. 


2) Approach based on Laughlin and Bell 

The sampling-related uncertainty is estimated based on the Laughlin-Bell approach according 
to Eqs. (6), (7), and (8) described in section 2a. The key rainfall parameters are the variance a 2 A 
and time correlation t A of the area-average rainfall rate time series R A (t). In order to see how 
important the c 2 -term in Eq. (6) is, two different estimates are computed: one based on using the 
c x -term only in (6) — i.e., setting c 2 = 0 as in Bell et al. (2001) — and the other based on using 
both terms c, and c 2 . The sampling-related uncertainties are expressed relative to the true rain 
rate as o e /R . 
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Figures 2-4 show the distributions of three key parameters for the four-month data set: the 
space and time domain-average true rain rate R (Fig. 2), the variance in time of the 
instantaneous area-average rain rates (Fig. 3), and the time correlation r A of the instantaneous 
area-average rain rates (Fig. 4). 1 The number shown in the bottom right comer of a panel 
indicates the sample size contained in that distribution. The distributions are normalized using 
their respective sample size. The maximum value of a given distribution is shown in the upper 
right comer of the panel. 

The rain rate distribution significantly widens with decreasing time integration, but also with 
decreasing domain size (albeit not as quickly), as shown in Fig. 2. This is expected because 
reduced levels of averaging will retain peak rain rate values more easily. The maximum space 
and time domain-average rain rate increases from approximately 0.3 mm/h for the 500 km and 
30 day setting (top left panel in Fig. 2) to roughly 3.4 mm/h for the 100 km and 1 day 
configuration (bottom right panel). To put this in perspective, a rainfall of 0.3 mm/h intensity 
accumulates approximately 7 mm per day and 220 mm per month. At 3.4 mm/h more than 80 
mm of rain (average over a 10,000 km 2 area) are generated in a day. The four-month data set 
thus comprises a representative range of mean rain rates. 

The variance a\ of the area-average rain rate trace increases rapidly both with decreasing 

space and time domain, as shown in Fig. 3. The time correlation r A of the area-average rain rate 
trace, however, exhibits a rather different behavior (Fig. 4). Although the maximum of r A 
appears to be similar for given time periods, independent of the space domain, the bulk of the 

1 The correlation time T A of the instantaneous area-average rain rate is determined as the 1/e -folding time of the 
autocorrelation. The autocorrelation function is obtained by dividing all covariances by the geometric mean of the 
corresponding variances. The covariance function is estimated by summing the lagged products and dividing by the 
length of the time series. The S-Plus software package was used, which is available from Statistical Sciences, Inc., 
Seattle, Washington 98109. 
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distribution of correlation times clearly shifts to smaller values with decreasing space and time 
domains. A bimodal distribution with typical values of r A ~ 5.5 and 8.5 h is seen for monthly 
rainfall on a 500 km domain. Much shorter time correlations (< 3 h) are observed for daily 
rainfall on 100 km domains. 

The coefficient of variation cr A /R of the area-average rain rates is directly proportional to 
the sampling uncertainty, as can be seen from (10). This rainfall parameter, shown in Fig. 5, will 
be used later in the discussion of results (section 3c). 

3. Results and discussion 

a) Characteristics of rainfall sampling uncertainty 

The sampling-related uncertainty is estimated for any combination of the various space and 
time domains, and sampling frequencies explored based on the four-month data set. Note that 
the examined data set represents the equivalent of 2 years worth of data for a 500 km domain, 16 
years for a 200 km domain, and 64 years for a 100 km domain, respectively. Moreover, the 
sampling uncertainty is estimated using two distinctly different approaches, as outlined in section 
2b. The results obtained using the resampling by shifts technique are discussed first. (The 
results using the Laughlin-Bell approach will be described in section 3b.) 

Figure 6 summarizes the results of estimating the sampling uncertainty for the nine possible 
combinations based on three space (500 km, 200 km, and 100 km) and three time (30 day, 5 day, 
and 1 day) domains. In addition, within each panel the results for five sampling time intervals (1 
h, 3 h, 6 h, 8 h, and 12 h) are shown. The sampling uncertainty distributions are represented by 
their full range of values (bold solid line), the center 50% of values (outlined box), and the 
distribution median (bold dot). The results shown in Fig. 6 are limited to space and time 


Page 12 


domain-average rain rates R~ 0.1 mm/h (i.e., 0.075 <R < 0.125 mm/h). Results for other 
mean rain rates are presented later. The corresponding sample size (identical for all sampling 
frequencies) is indicated by the number in the bottom right comer of each panel. The dotted line 
(and shaded area) indicates sampling uncertainty estimates (and range of uncertainty) based on a 
fitted scaling law to the data, as will be discussed later as well. 

The sampling-related uncertainty clearly scales with space and time domain, and with 
sampling frequency, as can be seen from Fig. 6. The larger the space and time domain the 
smaller is the sampling uncertainty. Similarly, the higher the sampling frequency (i.e., smaller 
sampling time interval) the smaller is the related uncertainty. However, even for a narrow rain 

rate range of 0.075 <R< 0.125 (nominal R ~ 0.1 mm/h), a very significant range of sampling 
uncertainty is observed. This range of sampling uncertainty is a reflection of the great variability 
of rainfall in space and time. For example, for a TRMM-like sampling of At ~ 12 h, 2 the median 
of the distribution of sampling uncertainty for daily rainfall on a 100 km domain (bottom right 
panel of Fig. 6) is 154%, yet the center 50% of the distribution spans from 116% to 196% 
(extreme values of sampling uncertainty are found as low as 40% and as high as 460%). For a 
GPM-like sampling ( At ~ 3 h), this sampling uncertainty drastically reduces to 43% (median), 
with half the estimates falling within the range of 26%-67%. The sampling-related uncertainties 
for monthly rainfall on a 500 km domain (top left panel), as observed by a TRMM-like satellite 
platform, show a median value of 17%, with the center 50% of values falling between 8% and 
20 %. 


2 Note that at the Equator the true TRMM sampling is closer to At ~ 24 h. For simplicity, however, variable 
sampling intervals as a function of latitude are not considered in this study. 
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Similarly to Fig. 6, the sampling-related uncertainty may be shown for any mean rain rate. 
Rain rates of 0.5 mm/h, 1 .0 mm/h, and 1 .5 mm/h are selected to highlight the scaling of sampling 
uncertainty with rain rate in Fig. 7; however, results are shown for 1 day periods only. No 
samples exhibited mean rain rates of 1 mm/h and 1.5 mm/h, respectively, for daily rainfall on 
500 km domains. Similarly, there were no samples with mean rain rates of 1.5 mm/h for daily 
rainfall on 200 km domains. Nonetheless, the scaling of sampling uncertainty with domain size 
can be seen for all rain rates, and by comparing Figs. 6 and 7 a scaling with rain rate becomes 
apparent. 

In order to quantify the scaling of sampling uncertainty with space and time domain size, 
sampling frequency, and mean rain rate, the distribution medians were determined for all forty- 
five combinations of space (3 options) and time (3) domains, and sampling frequencies (5). The 
median was selected rather than the mean because of its much reduced sensitivity to extreme 

values (outliers). Moreover, this was done for the rain rate range of 0 < R < 3.5 mm/h in steps of 
0.05 mm/h (70 intervals). The resulting large ensemble of distribution medians was then used to 
fit the coefficients a , b , c , d , and e of the following simple sampling uncertainty scaling law 



by minimizing the root-mean-square (RMS) difference between the predicted uncertainty (12) 
and the corresponding median value, using R 0 = 1 .0 mm/h, L 0 = 500 km, T 0 = 30 day, and 

A t 0 = 1 h, respectively. 3 In addition, sensitivity tests were performed to assess the robustness of 
the coefficient values (Table 1). In particular, we assessed the variability of the coefficients from 

3 The multiplicative factor a was adjusted by means of removing the mean difference (bias), while the coefficients 
b , c, d , and e were determined iteratively (in steps of 0.05) to find the minimum RMS difference. 
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month to month, and also using only medians that were based on distributions containing 
minimally 1, 5, 10, or 15 samples. By increasing the minimum number of samples required in a 
given distribution, the respective median values are thought to become more representative and 
thus given priority in the fitting procedure. 

Based on the results summarized in Table 1, we selected a = 0.80, b = 0.20, c = 0.70 , 
d =0.35, and e = 1.05 as the “best-fit” coefficients of (12). These coefficients exhibit some 
variability from month to month and depend on the underlying data constraints; however, overall 
they appear to be rather robust estimates. The exponents (i.e., coefficients b , c , d , and e) may 
be uncertain at the 10% level and the overall prediction of sampling uncertainty at the 25% level, 
based on the results compiled in Table 1 and experimentation with weighted fitting procedures in 
logarithmic space (not shown). 

Equation (12), using the fitted coefficients, displays a scaling of sampling-related uncertainty 
of rainfall estimates that is pretty much linear in sampling time interval At ( e = 1.05) — at least 
for the range 0 < At < 12 h investigated (see Figs. 6 and 7) — similar to the results obtained by 
Steiner (1996) or Li et al. (1996), and as predicted by Eq. (10). A linear scaling in At suggests 
that the autocorrelation of the area-average rain rate should decrease roughly exponentially, as 
originally assumed by Laughlin (1981). This linearity predicted for small At , however, depends 
only on the small-lag behavior of the autocorrelation, and does not contradict potentially 
different behavior for longer time lags, as is sometimes observed. For example, Rodriguez- 
Iturbe et al. (1998) provide evidence that rainfall observations appear to have a long-range 
memory, which suggests that the scaling with sampling time interval might change for larger At . 
In fact, Weng et al. (1994) show that an approximate linearity in scaling of sampling uncertainty 
for At < 12 h starts to break down for At > 12 h. The scaling of sampling uncertainty with time 
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domain T (d = 0.35) is close to (albeit not quite) the inverse square-root behavior advocated by 
(10). The scaling with space domain size L (c = 0.70) is very similar to what would be 
predicted from GATE I rainfall data, as discussed in section 2a. 

Figure 8 highlights the scaling of sampling uncertainty with mean rain rate R (b = 0.20), 
showing a clear departure from the inverse square-root behavior suggested by simple models 
(e.g., Bell and Kundu 2000). The scaling of sampling uncertainty as R "° 5 is bom out of the 
assumption that variations in total rainfall within an area would primarily be due to variations in 
the number of independently evolving precipitation systems present rather than variations in the 
intensity of the individual system. If domains with more rain tend also to have larger spatial 
extent of rainy areas and/or more intense rain, the R~ 0 5 dependence may be altered. Apparently 
the above assumptions leading to a R~ 0 ' 5 behavior are not applicable here. 

The shaded areas in Figs. 6, 7, and 8 outline the range of 0.75-1.25 times the sampling 
uncertainty estimated by (12) using the median-fitted coefficients (dotted lines). This 
uncertainty range roughly approximates the center 50% of the sampling uncertainties estimated 
by the resampling by shifts method. 

How well does the scaling law (12) based on the fitted coefficients gauge the sampling- 
related uncertainty using all four months’ worth of data? The visual impression obtained from 
Figs. 6, 7, and 8 suggests that the simple scaling law (12) predicts the sampling uncertainty as a 
function of space and time domain, sampling frequency, and mean rain rate rather well. A closer 
inspection though reveals deviations from the approximate linear scaling in At . For example, a 
scaling with sampling time interval of A t e>l is hinted for 500 km domains, while a scaling more 
like A t e<x is observed for 100 km domains. These nuances become most visible for sampling 
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time intervals of At > 8 h. Recall Fig. 4 that displayed ^<3 h for 100 km domains, which is 
significantly smaller than the s am pling time interval, consistent with the discussion in section 2a. 

Tables 2 and 3 compile the actual mean and RMS differences (in units of %) between 
sampling uncertainties estimated by the resampling by shifts method and predicted by the 
median-fitted 'scaling law (12). The mean differences (Table 2) are typically small (a few 
percent only); mean differences of 10% or larger are found for At >6 h, but compared to the 
values of the corresponding distribution median (Figs. 6 and 7) these differences are mostly still 
relative small. The RMS differences (Table 3), in contrast, show magnitudes comparable to the 
median values for most space and time domains and sampling time intervals of At <6 h; only 
for lower sampling frequencies reduce the RMS differences to a fraction of the respective 
median values. 

These results underline the basic difficulties in estimating sampling-related uncertainties for 
real rainfall situations. In light of the above discussion, and because the estimation methods 
applied are statistical in nature, the derived sampling uncertainties should be expressed in 
probabilistic terms. For example, based on the four month’s worth of data analyses, there is a 
50% chance that the true (yet unknown) sampling uncertainty falls within the range of 0.75-1.25 
times the sampling uncertainty predicted by the median-fitted simple scaling law (12). 

Moreover, in the “real world”, attaching a sampling uncertainty to satellite rainfall averages 
is based on the sample averages themselves, because the underlying true rainfall is unknown. 
This, of course, introduces additional uncertainty that needs to be quantified. Facing this 
problem, however, is beyond the scope of the present study. 
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b) Comparison of two approaches 

It is instructive to compare sampling uncertainties estimated based on approaches other than 
the one described in the previous section. The observed differences will highlight a sensitivity of 
the results to the method applied obtaining them. Here, sampling uncertainties estimated by the 
resampling by shifts technique (non-parametric approach) are contrasted with results obtained by 
the (parametric) Laughlin-Bell approach, first based on using the c x -term in Eq. (6) alone. (The 
results based on also including the c 2 -term are presented later.) Figures 9 and 10 show this 
comparison for the same data as displayed in Figs. 6 and 7, respectively. For clarity of the 
figures, however, the data are shown in a slightly different way: there are fifteen panels for all 
combinations of time periods and sampling frequency, and the results are distinguished in colors 
by domain size (500 km in red, 200 km in green, and 100 km in blue). 

The encouraging outcome of this comparison is that the sampling uncertainties estimated by 
both the non-parametric and parametric statistical approaches agree rather well, independent of 
space and time domain, and sampling frequency, as demonstrated by Figs. 9 and 10. A closer 
look, however, reveals that there is significant variability (and potentially some minor trends) 
among the results that has to be attributed to differences in the way the sampling uncertainty is 
estimated. Interpreting these nuances is not straightforward though and requires further 
evaluation. 

Tables 4 and 5 list the actual mean and RMS differences of the data displayed in Figs. 9 and 
10 to provide some quantitative information about the comparison. The mean difference (Table 
4) between the resampling by shifts and the Laughlin-Bell approaches typically amounts to a few 
percent only (the maximum difference is 12.4%). Most of the time the Laughlin-Bell approach 
tends to predict sampling uncertainties that on average are slightly larger than those obtained by 
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the resampling by shifts method. This may not fully concur though with the visual impression 
obtained from Figs. 9 and 10. The RMS differences (Table 5) vary between approximately 1% 
and 17%. Moreover, the RMS differences appear to scale with space and time domain size, 
sampling frequency, and mean rain rate, similarly to the estimated sampling uncertainties. For 
daily rainfall on a 100 km domain observed by a GPM-like sampling ( At ~ 3 h), the RMS 
difference between sampling uncertainties estimated by the resampling by shifts and Laughlin- 
Bell approach is about 12%-13%, which is significantly less than the sampling uncertainty itself 
(see Figs. 9 and 10) for mean rain rates of 1.0 mm/h or less. For sampling time intervals At ~ 1 
h, the RMS differences are of the same magnitude as the median values of sampling uncertainty 
(Figs. 6 and 7). Especially for longer sampling time intervals, however, the RMS differences 
tend to be a fraction of the sampling uncertainty only. For a TRMM-like sampling ( At ~ 12 h), 
the RMS differences are small compared to the value of the sampling uncertainty for all space 
and time domains examined. 

The RMS difference between sampling uncertainties estimated by the resampling by shifts 
method and the Laughlin-Bell approach, as shown in Table 5, is of comparable magnitude or 
smaller (particularly for At >3 h) than the RMS difference between uncertainties estimated by 
the resampling by shifts method and those predicted by the median-fitted simple scaling law 
(12), compiled in Table 3. The largest differences occur for infrequent sampling (At > 3 h) of 
small mean rain rates on smaller domains (<200 km), where the data-based uncertainty 
estimates agree more closely with each other than to the uncertainties gauged by the simple 
scaling law (12). 

What is the effect of the c 2 -term when estimating the sampling uncertainty using Eq. (6)? 
The effect of including this term in the Laughlin-Bell estimation procedure becomes noticeable 
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for 5 day and particularly for 1 day periods, as highlighted by Fig. 11. Note that no distinction 
was made for different sampling frequencies, because of the small overall effect. The effect 
becomes more apparent with decreasing time rather than space domain. However, the maximum 
difference in sampling uncertainty estimates between using the c l -term only (i.e., setting c 2 = 0 ) 
or using both terms c, and c 2 in (6) was less than 10% for the space and time domains explored. 
This is clearly less than the difference between estimating the sampling uncertainty based on the 
resampling by shifts method and the Laughlin-Bell approach, as displayed in Figs. 9 and 10 and 
gauged by Tables 4 and 5. On a monthly or even weekly basis, therefore, the c 2 -term may 
safely be ignored. 

c) Discussion 

There are numerous studies of sampling uncertainty assessments for satellite-based rainfall 
estimates reported in the literature. Most of these are (a) based on rather limited data samples 
and/or (b) primarily concerned with one particular approach of estimating the sampling 
uncertainty. Notable exceptions to (a) are the studies of Oki and Sumi (1994) and Steiner 
(1996), the former using a large data set of gauge-adjusted radar data over Japan and the latter 
lots of rain gauge data from Melbourne, Florida, and especially Darwin, Australia. An exception 
to (b) is the study of Li et al. (1996), who compared rainfall sampling uncertainties estimated 
based on stationary and non-stationary rainfall models, plus the resampling by shifts method — 
albeit on one month of data from Darwin only. Much research has focussed on assessing the 
uncertainty of rainfall averages as a function of sampling frequency for fixed space and time 
domains. The scaling of sampling-related uncertainty with space and time domains has received 
attention mostly from a theoretical perspective based on stochastic rainfall model assumptions. 
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Similarly, the dependence of sampling uncertainty on rainfall characteristics awaits a thorough 
evaluation based on large amounts of data. 

The extensive analyses presented here provide thus a unique basis for evaluating the 
sampling uncertainty behavior as a function of space and time domains, sampling frequency, and 
rainfall characteristics. Moreover, the results of this study enable comparison to sampling 
uncertainties estimated over a wide range of climatic rainfall conditions and sampling 
configurations. This is achieved by scaling the respective results to a common basis in terms of 
space and time domains, sampling frequency, and rainfall. Before we can do so, however, we 
need to concern ourselves with the problem of rainfall calibration first. A calibration error in 
rainfall could potentially affect comparisons in two different ways: (i) through errors in the 
estimation of the sampling uncertainty and/or (ii) the comparison of sampling uncertainties 
derived for various climatic rainfall regimes or observing platforms. 

A rainfall calibration error will affect both the variance of the area-average rainfall and the 
mean rain rate. Fortunately, however, the relative sampling uncertainty, expressed in terms of 
the ratio of standard deviation divided by the mean, remains unaffected by a rainfall calibration 
error — at least when the calibration error is multiplicative in nature. Similarly, the correlation 
time of the area-average rainfall is not affected as well. Thus, the relative sampling uncertainties 
estimated by both the resampling by shifts and the Laughlin-Bell methods are unaffected by 
calibration error. 

The relative sampling uncertainty needs to be tied to an absolute mean rain rate, however, in 
order to make it comparable to results obtained for different climate regimes or observing 
platforms. This is where the problem of a potential rainfall calibration error may enter. A 
simple back-of-the-envelope calculation shows how much difference a calibration error might 
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cause. Let us assume that <t e /R is the relative sampling uncertainty estimated for a given mean 
rain rate R and fixed space and time domains, and sampling frequency. Moreover, let us 
assume that the sampling uncertainty scales as Eq. (12) suggests, cr E /R =aR~ p . It can be 
shown then that the rainfall calibration error y has no effect on the power factor ft . Because the 
relative sampling uncertainties obtained for two different rainfall calibrations are identical, as 
demonstrated above, 

dR- p = = . (13) 

R jR 

The two multiplicative factors of the above scaling law, therefore, are related through a = ocy p . 
In order to quantify this effect, let us assume a calibration error of y = 2 — radar-based rainfall 
estimates may easily be in error by a factor of two compared to rain gauges, because of 
application of an inappropriate relationship between radar reflectivity and rain rate, or radar 
hardware calibration problems (e.g., Steiner et al. 1999). For a power factor of /? ~ 0.2 , as seen 
in the present analyses (section 3a), the effect of such a calibration error causes a difference of 
15%. Note that Chang and Chiu (2001) and Bell et al. (2001) find /? ~ 0.3 based on several 
years worth of SSM/I and TMI rainfall estimates over tropical latitudes. For a power factor of 
P ~ 0.5 , which appears more typical for rainfall over southern Japan (Oki and Sumi 1994; see 
also Bell and Kundu 1996, 2000) and near Darwin, Australia (Steiner 1996), however, the 
calibration error will result in a 40% difference. In summary, one has to be aware of the rainfall 
calibration problem when comparing results of sampling uncertainties estimated for various 
locations and/or observing platforms. Moreover, for as long as the dependence of ft on rainfall 
characteristics remains unknown, there is an inherent uncertainty with regard to the choice of ft 
when scaling sampling uncertainties to a common basis. 
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Armed with knowledge about these caveats, let us now focus on comparing the results of the 
present analyses with sampling uncertainties estimated for various climatic regions. These 
comparisons will be limited primarily to studies concerned with infrequent but regularly timed, 
flush-visits made by a single satellite, similar to our assumptions. Laughlin (1981), McConnell 
and North (1987), North and Nakamoto (1989), and Nakamoto et al. (1990), all using radar- 
based rainfall data collected during GATE, found sampling-related uncertainties of 8%-10% for 
monthly (30 day) rainfall estimated based on 12 h sampling over a squared domain with side 
length of 280 km. The mean rain rate for GATE Phase I was 0.445 iran/h (Kedem et al. 1990; 
Bell et al. 1990). The present data set does not contain rainfall examples of that intensity over a 
similar space and time domain. However, based on the median-fitted sampling uncertainty 
scaling law (12), the corresponding sampling uncertainty is predicted as 19.2%. Moreover, there 
is a 50% chance (i.e., shaded area in Figs. 6 and 7) that the true sampling error would fall within 
the range 14.4%-24.0%. Both, the GATE rainfall (Hudlow and Patterson 1979) and the rainfall 
data used in this study (Fig. 1) appear to compare favorably with contemporaneous rain gauge 
measurements. Thus, radar-rainfall calibration errors may not explain the difference in estimated 
sampling uncertainty. In addition, the difference, whether the sampling uncertainty is gauged 
based on a scaling law (12) fitted to the results of the resampling by shifts approach (section 3a) 
or the results obtained by the Laughlin-Bell approach (fitted coefficients not shown), amounts to 
a few percent only. The difference in sampling uncertainty estimated for GATE rainfall and 
rainfall in the central United States, therefore, appears to be real and has to be attributed to 
differences in rainfall characteristics. For example, the coefficient of variation <7 a /R of the 
area-average rain rates over a 200 km domain is significantly smaller for GATE rainfall (~1.9) 
than for the data underlying this study (~3), as can be seen from Fig. 5 (see 30 day, 200 km 
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panel). On the other hand, the correlation in time of GATE rainfall ( f A ~ 8 h) appears 
somewhat longer than for the central United States precipitation (Fig. 4). The observed 
differences in rainfall characteristics are consistent with the differences seen in sampling 
uncertainty between analyses based on GATE rainfall and this study. 

Seed and Austin (1990), using radar information of rainfall observed in Florida during 
August 1987, report a sampling-related uncertainty of 22% for the 20 day rainfall accumulation 
over a 425 km domain estimated based on 12 h sampling. The corresponding mean rain rate was 
0.1 mm/h. Using (12) a sampling uncertainty of 22.2% is predicted for a similar configuration, 
with a 50% likelihood of the true value to range within 16.6%-27.8%. This excellent agreement, 
however, is likely to be fortuitous in light of the fact that the radar-based rainfall data used by 
Seed and Austin (1990) have not been calibrated with rain gauges. The coefficient of variation 
of approximately 2.8 and time correlation of 3 h estimated by Seed and Austin (1990) are similar 
to the present analyses. 

For their analyses of sampling uncertainty, Li et al. (1996) used data collected during 
December 1989 through February 1990 as part of the Down Under Doppler and Electricity 
Experiment (DUNDEE; Rutledge et al. 1992) by a radar located near Darwin, Australia. For a 
monthly rainfall accumulation (mean rain rate R ~ 0.1 mm/h) over a 200 km domain, estimated 
based on 12 h sampling, a sampling-related uncertainty of 26% was obtained essentially 
independent of whether this uncertainty was estimated based on a stationary or non-stationary 
model, or the resampling by shifts technique. Present analyses based on using (12) suggest a 
sampling-related uncertainty of 32.7% (± 8.2%) for a similar configuration. This is in fairly 
close agreement, particularly considering that the radar-rainfall data set used by Li et al. (1996) 
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was only roughly calibrated with rain gauge information, and that the coefficient of variation of 
the area-average rain rates was about 2.5 and the time correlation approximately 3.5 h. 

Soman et al. (1995, 1996) provide another set of analyses of radar-based rainfall data 
collected in January and February 1988 at Darwin, Australia. The sampling uncertainties 
estimated for the 18 day (i? ~ 0.28 mm/h, o a /R~\.6\) and 21 day (R ~ 0.43 mm/h, 
<7 a /R ~ 1.47 ) time periods over a domain of roughly 280 km side length were approximately 
32% and 25%, respectively, based on a TRMM-like ( At ~ 12 h) sampling frequency. The 
sampling uncertainty estimates obtained by either the resampling by shifts method (Soman et al. 
1995) or space-time spectral analyses (Soman et al. 1996) were in close agreement. These 
estimates compare also favorably with predictions of sampling uncertainty for similar 
configurations using (12): 25.2% (±6.3%) and 21.9% (±5.5%) for the first and second phase, 
respectively. 

4. Conclusions 

The uncertainty of rainfall estimates obtained from discrete satellite sampling in space and 
time was assessed based on multi-year, continental-scale radar-mosaic data. Uncertainties for 
typical space and time domains, and sampling frequencies were evaluated. The sampling 
uncertainty was investigated for all combinations of 1 h, 3 h, 6 h, 8 h, or 12 h sampling of rainfall 
over 100 km, 200 km, or 500 km domains, and 1 day, 5 day or 30 day accumulations. The 
analyses of four selected summer months represent the equivalent of 2 years worth of analyses 
on a 500 km domain, 16 years on a 200 km domain, and 64 years on a 100 km domain. 
Moreover, a theoretical framework was established that enabled direct comparison of parametric 
and non-parametric statistical approaches to estimating the sampling-related uncertainty. In 
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particular, results based on a statistical methodology with roots in the work of Laughlin (1981) 
and Bell et al. (1990) were contrasted with those obtained by a simple empirical resampling by 
shifts technique. 

The main results of this study may be summarized as: 

• The sampling uncertainty scales inversely with space (L) and time (T) domain size, and 
rainfall (R), but directly with sampling time interval ( At ). The scaling with space and time 
domain, and sampling frequency behaves as anticipated from previous studies. The scaling 
with rainfall, however, deviates significantly from the expected inverse square-root behavior 
as predicted by simple theoretical models, which appeared to account for the results of Oki 
and Sumi (1994) and Steiner (1996). 

• The rainfall sampling uncertainty, expressed as a percentage of the domain-average rain rate, 
may be characterized by a simple scaling law 



(14) 


where R 0 = 1 mm/h, L 0 = 500 km, T 0 = 30 days, and At 0 = 1 h. Although (14) captures the 

main features of the central United States precipitation data, there is significant variability of 
sampling uncertainty about this simple (fitted) scaling law, some of which is certainly 
attributable to the great space-time variability of rainfall. 

• Sampling uncertainties predicted by (14) are statistical in nature and should therefore be 
expressed in probabilistic terms. For example, based on the data examined, there is a 50% 
chance that the true sampling uncertainty may reside within the range of 0.75-1.25 times the 
value estimated by (14). 
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• The results of the parametric Laughlin-Bell and non-parametric resampling by shifts 
approaches to estimating the sampling-related uncertainty agree rather favorably, despite 
some appreciable variability. The differences between the two approaches highlight a 
sensitivity of the estimated sampling uncertainties to the choice of method. 

• A potential calibration error of the rainfall data does not affect the estimation of relative 
sampling uncertainty. However, an absolute calibration of the rainfall data is required in 
order to make the results comparable to other studies based on different climate regions 
and/or observing platforms. Such comparisons are highly sensitive to the accuracy of rainfall 
observations. 

• Comparison among different land-based data sets reveals comparable sampling-related 
uncertainties for rainfall estimates based on discrete observations in space and time. 
Sampling uncertainties estimated for oceanic rainfall (e.g., GATE), in contrast, are somewhat 
smaller. This result is consistent with a larger variability and shorter time correlation of 
rainfall over land than over ocean (e.g., Ricciardulli and Sardeshmukh 2002). 

Additional work is required to evaluate the relationship between rainfall characteristics and 
the power law of the scaling with domain-average rain rate. This will encompass analyses of 
rainfall data collected in a wide variety of climate regions. Moreover, future investigations may 
reveal that, besides the domain-average rain rate, variance and correlation time of the area- 
average instantaneous rain rate, there might be other descriptors of rainfall characteristics 
important for predicting sampling-related uncertainty. 
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Appendix: Details of estimating sampling uncertainty according to Laughlin 

Some details of the derivation of Eq. (6) are given here. The average squared uncertainty 
e 2 (t 0 ) for a particular starting time t 0 is estimated in Laughlin’s (1981) approach by writing the 
ensemble average in terms of the lagged correlations of the area-average rain rate R A {t), starting 
from the definition 

{e% )) = ([*(', )-*f) (Al) 

-([»'<», )-Ff), (A2) 

where R(t 0 ) is the sample average rainfall for starting time t 0 and R is the true mean rainfall as 
given in Eqs. (3) and (2), respectively; and where the primes indicate deviations from the 
ensemble mean, z =z-(z). Equation (A2) expands to 

(£ 2 (<.)> = ([^(<.)f)+([5?)-2{^('.F>. (A3) 

Substituting (3) into the first term of (A3), we obtain 

+iA(fc(< 0 +yA/)), (A4) 

' in 1=0 j= o 

and defining the lagged covariance of R A ( t ) as 

c A (r) = (R A {t + r)R A {t)), (A5) 

which was assumed by Laughlin (1981) to depend only on the separation in time of the two rain 
rates, we can write (A4) as 



(A7) 


= ~T S(«-Wk(?A/), 

n q=-(n- 1 ) 

where we have used the dependence of c A [(/ - j)At] on i and j only through the difference 
i — j to rewrite the double sum in (A6) as a single sum in (A7). Note that (A7) does not in fact 
depend on t 0 . 

Likewise, the second term in (A3) can be written in terms of c A (t) using the definition of R 
in Eq. (2) to obtain 

T T 

M)-^/ icA-t,)*A <A8) 

1 0 0 

= i J( r - Hh (« . ( A9 ) 

1 -T 

where the double integral in (A8) has been reduced to a single integral by taking advantage of 
the integrand’s dependence on the difference in the two integration times. As in the case of 
(A7), (A9) does not depend on t 0 . 

The cross term in (A3), after substitution for R and R{t 0 ) from Eqs. (2) and (3), becomes 

n \ r 

(A (f 0 )A') = -Xvr J ^(/ 0 +/Ar-i,Vf ] . (A10) 

Although more complex than the previous two terms, this can be simplified considerably if we 
take into account at this point the resampling by shifts averaging in Eq. (5), which we 
approximate as a continuous average, 

(*W\ • (All) 

We can then use (A10) and n = Tj At to write (A1 1) as 
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where the last step in (A12) uses (A8). Laughlin’s (1981) approach therefore gives an 
approximate expression for the resampling by shifts average over Eq. (A3) as 

(A13) 

Laughlin (1981) proposed approximating the lagged covariance as an exponential, 

c A {T)=a 2 A e^ /T \ (A14) 

where c\ is the variance and t a the correlation time of the instantaneous area-average rain rate 
R a (t) • If the assumed form of the autocorrelation (A 14) is substituted into (A7) and (A9), some 
straightforward algebra and the summation identities 


n~\ 1 _« 

y><=— 

U l-z 

q z — [« — (« — l)z]z" 
,=o (1-z) 


(A15) 


give the result in Eq. (6). Note that assumptions different from (A14) for the lagged 
autocovariance will result in different expressions for (A13). 
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Table 1. Results of sensitivity analyses for fitting scaling law coefficients to Eq. (12). See text 
for details. 


Data Medians 

Samples 

a 

b 

C 

June 1999 - all 

675 

0.69 

0.20 

0.70 

- min 5 values 

420 

0.80 

0.15 

0.65 

- min 10 values 

315 

1.01 

0.10 

0.65 

- min 15 values 

255 

1.13 

0.10 

0.65 

July 2000 -all 

645 

0.74 

0.25 

0.70 

- min 5 values 

425 

0.82 

0.20 

0.70 

- minimum 10 

310 

1.05 

0.15 

0.65 

- minimum 15 

250 

1.07 

0.15 

0.65 

August 1997 -all 

755 

0.80 

0.15 

0.70 

- min 5 values 

365 

0.75 

0.20 

0.70 

- min 10 values 

270 

0.80 

0.20 

0.65 

- min 15 values 

210 

0.81 

0.20 

0.65 

September 1998 - all 

535 

0.64 

0.25 

0.75 

- min 5 values 

275 

0.97 

0.20 

0.65 

- min 10 values 

210 

1.25 

0.15 

0.65 

- min 15 values 

165 

1.28 

0.15 

0.65 

All four months - all 

845 

0.64 

0.20 

0.75 

- min 5 values 

565 

0.70 

0.20 

0.70 

- min 10 values 

480 

0.79 

0.20 

0.70 

- min 15 values 

410 

0.80 

0.20 

0.70 
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Table 2. Mean difference between sampling uncertainties estimated by the resampling by shifts 
approach and the fitted scaling law (12) using the coefficients a = 0.80, 6 = 0.20, c = 0.70, 
d = 0.35 , and e = 1.05 . The numbers are based on the results displayed in Figs. 6 and 7. 


R 

T 

L 

At = 1 h 

At = 3 h 

At = 6 h 

> 

ii 

oo 

<N 

II 

3 



500 km 

-0.7% 

-2.2% 

-3.8% 

-5.2% 

- 2.7% 


30 day 

200 km 

-1.1% 

-2.6% 

-2.3% 

-0.8% 




100 km 



0.9% 

-1.4% 




500 km 

-1.1% 

-2.5% 

-4.3% 

-5.4% 

0.6% 

0.1 mm/h 

5 day 

200 km 

-1.4% 

-2.5% 






100 km 

-1.4% 

3.0% 

10.5% 

10.5% 

2.9% 



500 km 

-0.2% 

-0.2% 

2.7% 

4.8% 

11.9% 


1 day 

200 km 

-0.2% 

2.8% 

11.0% 

12.9% 

9.2% 



100 km 

0.4% 

8.8% 

11.8% 

6.1% 

-15.9% 



500 km 

-1.4% 

-2.9% 

-5.7% 



0.5 mm/h 

1 day 



-1.8% 

4.2% 

8.5% 

13.2% 



100 km 

-2.8% 

2.6% 

11.5% 

11.5% 

2.2% 



500 km 


• 


- 




200 km 

-1.5% 

-2.9% 

4.8% 

7.0% 

1.4% 





-0.1% 

5.1% 

6.2% 

2.2% 



500 km 

- 

i 

1 

- 

- 

- 

1.5 mm/h 

1 day 

200 km 

- 

MB 

- 

- 

- 



100 km 

-3.4% 

-8.4% 

-10.6% 

-9.4% 

-19.2% 
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Table 3. RMS difference between sampling uncertainties estimated by the resampling by shifts 


approach and the fitted scaling law (12) using the coefficients a = 0.80, b - 0.20, c = 0.70, 
d = 0.35 , and e = 1.05 . The numbers are based on the results displayed in Figs. 6 and 7. 


R 

T 

L 

> 

II 

At = 3 h 

At = 6 h 

> 

II 

OO 

At = 12 h 



500 km 

0.8% 

2.5% 

4.5% 

5.9% 

7.6% 


30 day 

200 km 

1.3% 

3.5% 

6.3% 

8.7% 

11.8% 



100 km 

1.9% 

5.3% 

11.1% 

11.6% 

15.7% 



500 km 

1.4% 

4.1% 

7.8% 

9.1% 

13.3% 

0.1 mm/h 

5 day 

200 km 

2.6% 

7.2% 

16.5% 

19.9% 

27.1% 



100 km 

4.1% 

15.5% 

29.9% 

32.8% 

39.5% 



500 km 

2.8% 

8.3% 

15.6% 

21.3% 

30.2% 


1 day 

200 km 

6.4% 

21.8% 

38.0% 

43.6% 

49.9% 


! 

100 km 

11.7% 

32.8% 

46.8% 

51.6% 

62.1% 



500 km 

1.7% 

3.7% 

8.7% 

14.6% 

19.0% 

0.5 mm/h 

1 day 

200 km 

3.3% 

10.3% 

i 

24.6% 

30.8% 

40.1% 



100 km 

5.2% 

20.9% 

37.5% 

42.2% 

47.3% 



500 km 

" 

I 

- 

- 

" 

1.0 mm/h 

1 day 

200 km 

2.5% 

9.0% 

25.0% 

29.6% 

37.0% 



100 km 

4.2% 

i 

15.4% 

29.6% 

33.4% 

41.0% 



500 km 


- 

- 

- 

- 

1.5 mm/h 

1 day 

200 km 

! 

- 

“ 

- 

" 

- 



100 km 

3.9% 

11.4% 

14.4% 

20.7% 

33.7% 
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Table 4. Mean difference between sampling uncertainties estimated by the resampling by shifts 
and Laughlin-Bell approaches. The numbers are based on the results displayed in Figs. 9 and 10. 


R 

T 

L 

At = 1 h 

At = 3 h 

At = 6 h 





500 km 

-0.7% 

-1.8% 

-2.9% 

- 3.7% 

0.2% 


30 day 

200 km 

-1.5% 




-1.4% 





-2.5% 

-1.1% 

-2.0% 

-0.6% 



500 km 

-2.0% 

-4.6% 

-7.7% 

-9.4% 

-3.8% 

0.1 mm/h 

5 day 



-6.8% 

-6.0% 

-4.1% 

-1.2% 



100 km 

-4.9% 

-5.2% 

-0.2% 

1.0% 

1.3% 





-5.6% 

-5.8% 

-4.4% 

3.9% 


1 day 



- 7.0% 

-1.5% 

1.8% 

6.9% 



100 km 

-5.4% 

- 2.7% 

3.5% 

5.8% 

8.6% 



500 km 

-2.6% 

-5.9% 

jggjj|| 

-12.4% 

-11.5% 

0.5 mm/h 

| 

1 day 

200 km 

-4.5% 

-8.6% 


-1.5% 

7.3% 



100 km 

-7.1% 

-7.5% 

-0.2% 

2.8% 




500 km 

- 

- 

- 

- 

- 

1.0 mm/h 

1 day 

200 km 

-3.5% 

-7.9% 

-2.3% 

0.0% 

-2.0% 



100 km 

-5.6% 

- 7.3% 

-3.0% 

0.6% 

7.5% 



500 km 

- 

- 

- 

- 

- 

1.5 mm/h 

1 day 

200 km 



- 

- 

- 



100 km 

-4.0% 

-8.8% 

- 7.7% 

-2.3% 

0.3% 
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Table 5. RMS difference between sampling uncertainties estimated by the resampling by shifts 
and Laughlin-Bell approaches. The numbers are based on the results displayed in Figs. 9 and 10. 


R 

T 

L 

> 

li 

-s; 

m 

II 

< 

At = 6 h 

-si 

oo 

II 

< 

At = 12 h 



500 km 

0.8% 

2.3% 

3.5% 


U 


30 day 

200 km 

1.8% 

4.2% 

6.7% 


Hi 



100 km 

3.0% 

5.1% 

9.3% 

9.7% 

12.4% 



500 km 

2.2% 

5.3% 

.10.1% 

12.7% 

12.0% 

0.1 mm/h 

5 day 

200 km 

3.9% 

9.8% 

12.8% 

11.4% 

13.1% 



100 km 

6.4% 




13.5% 



500 km 

3.2% 

8.8% 

11.6% 

10.9% 

12.2% 


1 day 

200 km 

6.3% 



11.6% 

12.2% 



100 km 

9.1% 

12.5% 

12.2% 

12.8% 

13.3% 



500 km 


7.8% 

14.1% 

16.5% 

14.6% 

0.5 mm/h 

1 day 

200 km 

5.6% 




mm 



100 km 

9.1% 

13.7% 



11.4% 



500 km 

- 

- 

- 

- 

- 

1.0 mm/h 

1 day 

200 km 

5.7% 

13.2% 

12.5% 

7.9% 

13.6% 



100 km 

7.0% 

12.6% 

11.3% 

11.7% 

14.3% 



500 km 

- 

- 

“ 

- 

- 

1.5 mm/h 

1 day 

200 km 

“ 

- 

- 

- 

“ 



100 km 

5.2% 

11.8% 

12.7% 

12.2% 

9.7% 
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c) June 1999 d) July 2300 



0 100 200 300 400 0 100 200 300 400 0 100 200 300 400 

Raingauge Accumulation [mm] 



Figure 1. Geography and radar-rainfall calibration, (a) Study domain (shaded in gray) covering 
the Great Plains of the United States. The approximate boundaries of this domain are 35 N to 45 
N in degrees longitude and 80 W to 100 W in degrees latitude, (b) Radar-based versus gauge- 
accumulated rainfall for all four months combined, (c)-(f) Radar versus gauge rainfall for June 
1999, July 2000, August 1997, and September 1998, respectively. The ratio of mean radar to 
mean gauge rainfall is indicated in the bottom right comer of a panel. 
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30 day 


5 day 


1 day 



Space and Time Domain ^Average Rain Rates [mm/h] 


Figure 2. Distributions of space and time domain-average rain rate R based on four months of 
data for three averaging areas and three averaging periods. The sample size, indicated by the 
number in the bottom right comer of each panel, is used to normalize the respective mean rain 
rate distribution. The maximum value of each distribution is shown in the top right comer. 








1 00 km 200 km 500 km 


30 day 


5 day 


1 day 



0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 

Variance in Time of A re a- Average Rain Rates [(mm/h) A 2j 


Figure 3. Distributions of the variance g\ of the instantaneous area-average rain rate R A (t) 
based on four months of data. The notation within panels is similar to Fig. 2. 
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Normalized Distribution [%] 




30 day 


1 day 



03 69036 903 6 9 

Correlation in Time of Area-Average Rain Rates [h] 


Figure 4. Distributions of the time correlation t a of the instantaneous area-average rain rate 
R a (f) based on four months of data. The notation within panels is similar to Fig. 2. 
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Figure 5. Distributions of the coefficient of variation a A /R of the instantaneous area-average 
rain rate R A (/) based on four months of data. The notation within panels is similar to Fig. 2. 
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Normalized Distribution [%] 









Rainfall Sampling Uncertainty [%] 


Figure 6. Distributions of the sampling-related uncertainty (determined by the resampling by 
shifts approach) as a function of space and time domain, and sampling frequency. Shown are the 
results for a space and time domain-average rain rate of R ~ 0.1 mm/h based on the four-month 
data set. The full range of each distribution is shown by the solid line, the outlined box indicates 
the center 50% of the values, and the bold dot marks the distribution median. The sample size 
for each distribution (identical for all sampling frequencies shown within a panel) is indicated by 
the number in the bottom right comer. The dotted line is based on the fitted scaling law (12) 
characterizing the sampling-related uncertainty as a function of space and time domain, and 
sampling frequency, with R =0.1 mm/h. The surrounding shaded area marks the predicted 
sampling uncertainty ± 25% of its value. See text for details. 
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Figure 7. Distributions of the sampling-related uncertainty (determined by the resampling by 
shifts approach) as a function of space and time domain, and sampling frequency, similar to Fig. 
6. Shown are the results for 1 day rainfall only, but for increasing mean rain rates of 0.5 (left), 
1.0 (middle), and 1.5 mm/h (right panels). 
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Rainfall Sampling Uncertainty [%] 


Figure 8. Distributions of the sampling-related uncertainty (determined by the resampling by 
shifts approach) as a function of space and time domain, and domain-average rain rate in 
intervals of 0.05 mm/h, as used for the fitting of Eq. (12). Results are shown for a sampling time 
interval of 12 h. The lines, outlined boxes and shaded areas are similar to Figs. 6 and 7. 
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Sampling Uncertainty [%] - Resampling Approach 


Figure 9. Comparison of sampling uncertainty estimated by the Laughlin-Bell (c, term only) 
vs. resampling by shifts approaches for the various explored space and time domains, and 
sampling frequencies. Shown are the results for a space and time domain-average rain rate of 

R~0.\ mm/h based on the four-month data set. The results for the three space domains are 
shown in different colors (500 km in red, 200 km in green, and 100 km in blue). The dotted line 
indicates 1:1 correspondence. 
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Sampling Uncertainty [%] - Resampling Approach 


Figure 10. Comparison of sampling uncertainty estimated by the Laughlin-Bell (c, term only) 
vs. resampling by shifts approaches for the various explored space and time domains, and 
sampling frequencies, similar to Fig. 9. Shown are the results for 1 day rainfall only, but for 
increasing mean rain rates of 0.5 (bottom), 1.0 (middle), and 1.5 mm/h (top panels). 
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Laughlin/Bell Approach - Ratio of Terms 1 +2 / Term 1 


Figure 11. Distributions of the ratio of sampling uncertainty estimated by the Laughlin-Bell 
approach using both terms c, and c 2 in Eq. (6) and using the c, term only. Shown are the 
results for the space and time domains explored based on the four-month data set. No distinction 
is made for different sampling frequencies. The distributions are normalized by their respective 
sample size. 
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